Skip to content

【25-Q4-生态建设】模型迁移-研发效能部-模型训练-在PyTorch框架上支持 TResNet 在Cifar100上的训练#461

Open
x0212wwl wants to merge 1 commit intoTecorigin:mainfrom
x0212wwl:TResNet
Open

【25-Q4-生态建设】模型迁移-研发效能部-模型训练-在PyTorch框架上支持 TResNet 在Cifar100上的训练#461
x0212wwl wants to merge 1 commit intoTecorigin:mainfrom
x0212wwl:TResNet

Conversation

@x0212wwl
Copy link

@x0212wwl x0212wwl commented Jan 9, 2026

● 当前软件栈版本:
image

● 源码参考链接:https://github.com/huggingface/pytorch-image-models
● commit id:x0212wwl@ https://github.com/x0212wwl
● 工作目录:PyTorch/build-in/Classification/TResNet/
● 训练内容:使用1张TECO_AICARD_01芯片,在PyTorch框架上支持TResNet在Cifar100数据集上的训练。
● 运行脚本如下:
SDAA_VISIBLE_DEVICES=12 python weloTrainStep.py --name train --arch tresnet --batch_size 32 --datapath ../data --dataset cifar100 --steps 100 --epochs 100 --print_freq 1 | tee sdaa.log


● 100iters损失:
image

MeanRelativeError: -0.01588426280949183
MeanAbsoluteError: -0.068266
Rule,mean_absolute_error -0.068266
pass mean_relative_error=-0.01588426280949183 <= 0.05 or mean_absolute_error=-0.068266 <= 0.0002

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant